HUBERT  SPACE 


by 


JOHN  DAVID  PERINE 


B.  S.,  Washburn  University,   196ij 


A  MASTER'S  REPORT 


submitted  in  partial  fulfillment  of  the 

requirements  for  the  degree 

MASTER  OF  SCIENCE 

Department  of  Mathematics 

KANSAS  STATE  UNIVERSITY 
Manhattan,   Kansas 

1965 

Approved  by: 


for  Professor  ^» 


A  4 

P' 


ii 


TABLE  OF  CONTENTS 

INTRODUCTION   1 

VECTOR  SPACES 1 

PRE-HILBERT  SPACES   .    .' 8 

ORTHOGONAL  VECTORS .    .  15 

HILBERT  SPACE     21 

REFERENCES 33 

ACKNOWLEDGMENT 3U 


INTRODUCTION 

The  theory  of  Hilbert  space  grew  out  of  an  infinite  dimensional 
vector  space  which  was  first  extensively  studied  by  the  German  mathe- 
matician David  Hilbert  in  his  work  on  the  theory  of  integral  equations. 
This  space  is  the  classical  Hilbert  space  1  ,  which  is  described  in 
Example  7  below.  It  was  abstracted  by  J.  von  Neumann  to  the  variety 
of  linear  vector  space  known  as  Hilbert  space. 

The  infinite-dimensionality  of  classical  Hilbert  space  has  brought 
about  the  convention  of  reserving  the  terra  Hilbert  space  for  infinite- 
dimensional  spaces  satisfying  certain  conditions,  while  finite-dimensional 
spaces  satisfying  the  same  conditions  are  termed  Euclidian  spaces.  How- 
ever, most  of  the  definitions  and  theorems  contained  in  this  report  apply 
to  either  type  of  space;  therefore  this  convention  will  not  be  employed. 
That  is,  Euclidian  space  will  be  termed  finite-dimensional  Hilbert  space. 

In  the  following  pages,  the  structure  of  Hilbert  space  will  be  de- 
veloped as  a  special  type  of  linear  vector  space,  and  some  of  the  general 
properties  of  this  space  will  be  presented. 

VECTOR  SPACES 

A  vector  space  V(  9^)   over  a  field  /'is  a  set  of  elements  x,  y,  z,  ... 
satisfying  the  following  axions: 

(1)  r     forms  an  additive  Abelian  group,  where  the  operation,  denoted 
"+",  is  from  Kx  ^into  V.     That  is,  for  x,y,z  e  JT , 

A,    x  +  y»y  +  x, 

A2    x  +  (y  +  z)  =  (x  +  y)  +  z  , 


A.   there  is  a  Q,  €.  Ksuch  that  x  +  9  =  x  for  all  x  €  V , 

Aj    if  x  €.  V~,   then  there  is  a  -x  <£  V  such  that  x  +  (-x)  =  Q   . 
(2)  There  is  an  operation  from  %~x  Kinto  Y ',  denoted  by  juxtaposition 
and  called  scalar  multiplication ,  such  that  for  x,y  £  V  and  \  JU  &  ^r 

M-l    }.(x  +  y)  -  Ax  +  Ay  , 

M2   (a€>//)x  =  Ax  +  /vx  , 

M3    A(//x)  "  (X-/^)x  , 

M.        Ix  =  x  ,  where  1  is  the  multiplicative  identity  of  JF. 

In  the  above  axions,  ©  and  *  denote  addition  and  multiplication, 
respectively,  in  the  field  JF .  A  vector  space  can  be  defined  over  any 
algebraic  field  f  ,  and  is  often  described  by  the  particular  field. 
Thus  V  (^  ),  where  ft    is  the  field  of  real  numbers,  is  called  a  real 
vector  space,  and  V{C),   where  C  is  the  field  of  complex  numbers,  is 
called  a  complex  vector  space.  However,  in  the  sequel,  only  complex 
fields  will  be  used,  and  the  terms  vector  space  and  complex  vector  space 
will  be  equivalent.  The  elements  of  the  field  are  called  scalars.  Thus 
in  the  vector  spaces  to  be  considered  here,  the  term  scalar  will  refer 
to  the  complex  numbers.  By  reserving  the  lower  case  letters,  x,y,z,..., 
to  denote  the  elements  of  V ',   called  vectors,  and  using  lower  case  Greek 
letters  for  scalars,  no  confusion  will  arise  from  the  use  of  +  for  ©  , 
and  juxtaposition  for  • ;  therefore  this  convention  will  be  adopted  in 
the  sequel. 

EXAMPLE  1.  For  n  a  fixed  positive  integer,  let  Z'be  the  set  of  all 
n-tuples  x  =  (>>1  ,  X2,...}  x  n),   where  the  Ai  are  scalars.  If  y  = 

(/Vl>/•<2,•••,^n),  then  define  x  =  y  iff  Xi  =  Ai  for  all  i  =  1,2, ...,n. 
Ai  is  called  the  i-th  component  of  x.  Let  6  »  (0,0, ...,0)  with  a  total 


of  n  components,  -x  ■  ,  (-X^  -  X  2,...,  -  Xn),  x  +  y  =*  (X-^  +  //,  , 
X2  +  /Y2,.-.,  Xn  +  /fn),  and  Ax=  (X  ArX  A2»..<.  XXn).  Then  >- 
satisfies  axions  A, j  and  M  .  and  hence  is  a  vector  space,  referred  to 
as  the  vector  space  of  n-tuples. 

EXAMPLE  2.   Let  V  consist  of  the  set  of  all  scalar -valued  functions 
x  which  are  defined  and  continuous  on  a  closed  interval  [a,b],  together 
with  the  definitions:  for  all  t€  [a,b] ,  x  =  y  iff  x(t)  =  y(t),  9  is  the 
constant  function  9(t)  a  0  ,  -x  =  -x(t)  ,  (x  +  y)(t)=  x(t)  +  y(t)  , 
and  (Ax)(t)  ■  Xx(t).  Since  the  sum  of  two  continuous  functions  and 
the  product  of  two  continuous  functions  result  in  continuous  functions, 
these  definitions  are  suitable  for  defining  a  vector  space. 

EXAMPLE  3.  Let  V  consist  of  the  set  of  all  sequences  of  scalars 
x  =  ( Afc),  k  »  1,  2,  ...,  all  of  whose  terms  are  zero  from  some  index  m 
on  out.  That  is,  x  €  1/  implies  that  there  is  an  integer  m  such  that 
X±  =  o  for  i*  m.  If  y  -  (#k),  define  x  »  j   iff  X.  -  JU,    for  all  k, 
6  =  (0)  ,  -(Xk)  -  (-Ak),  and  X(Xk)  =  (XXk).  Again  V  with  these 
operations  satisfies  the  axions  for  a  vector  space.  This  space  is  called 
the  vector  space  of  finitely  nonzero  sequences. 

A  few  properties  of  vector  spaces  which  follow  from  the  axions. are 
listed  in  the  theorem  below. 

THEOREM  1.  If  Y  is  a  vector  space  and  if  x,y,z  €  V>   then 

(1)  the  equation  x  +  y  =  z  has  a  unique  solution  y  for  every  x,z£  Y \ 

(2)  if  z  +  z  =  z  ,  then  z  =  6  j 

(3)  A9  -  9  for  all  scalars  X  j 
(I4)  Ox  »  9  for  all  xeV; 


(5)  if  Ax  -  0,  then  either  X  =  0  or  x  =  9  . 
Proof: 

(1)  Let  y  =  (-x)  +  z  .  Then  x  +  y  =  x  +  (  (-x)  +  z)  =  (x  +  (-x)) 
+  z  =  9  +  z=z.  If  also  x  +  y'  =  z,  then  x  +  y  =  x  +  y1,  from  which 
y  ■  y1  by  adding  -x  to  each  side. 

(2)  Since  z  +  ©  =  z,  this  follows  immediately  from  (1). 

(3)  A(9)  =  X(e  +  e)  =  A©  +  \9.,  thus  by  (2),  X9  =  e. 

(U)  0  x  =  (0  +  0)x  ■  0  x  +  0  x.  Thus  by  (2),  Ox  =  9. 
(S)  if  A/  0  ,  then  A_1(  Ax)  =  (  X"1  A)x  =  1  x  =  x  =  6  . 
Note  that  0  x  =  (1  +  (-l))x  =  1  x  +  (-l)x  =  x  +  (-l)x  =»  9  ,  hence 
(-l)x  =  -x  ,  since  the  negative  of  x  is  unique  by  (1)  above. 

'  xl  =  xl  +  x2  +  x3  +  •*•  +  xn>  where  x-j£  V,  a   vector  space, 
and  i  ranges  over  l,2,...,n.  The  associative  law,  A„,  may  be  generalized 
to  a  sum  of  n  terms,  so  that  any  grouping  of  the  summands  yields  the 
same  sum.  The  commutative  law,  A,,  can  also  be  generalized,  so  that  in 

,  xi  ,  any  permutation  of  l,2,...,n  gives  the  same  sum.  Thus  X,  xj_  , 
is  uniquely  defined.  If  for  x  €  Y,   x  =  ^  ^±x±»   where  A  •  ar® 
scalars,  then  x  is  said  to  be  a  linear  combination  of  the  x.  .  For 
example,  in  the  vector  space  of  n-tuples,  let  x.  be  the  vector  all  of 
whose  components  are  zero  except  the  i-th  component,  which  is  one.  Then 
x~   ( A]_,  \2>'">  An)  3  Z-,\±x±   .  If  V  is  a  vector  space,  a  linear 
subspace  of  Y  is  defined  to  be  a  subset  f\    of  V  such  that  (i)  6  e  Y\  , 
(ii)  x  €  71   and  y  G  7l   imply  that  x  +  y  €  71  ,  and  (iii)  x  £  fl   and  A  a 
scalar  imply  that  Ax^.  It  is  evident  that  Y\   is  then  a  vector 
space  in  its  own  right;  axions  A-^  and  M-^  must  be  satisfied,  while 


x  £  71  and  condition  (iii)  give  (-l)x  =  -x  6  Y{   .  Thus  Ai  is  satisfied. 
An  immediate  consequence  of  this  definition  is  that  if  71   is  a  linear 
subspace  of  V*  ,  then  every  linear  combination  of  vectors  in  7\   belongs 
to  K   .  Conversely,  if  XI   is  a  subset  of  V  and  if  every  linear  com- 
bination of  vectors  in  71    belongs  to  Yl   ,  then  71    automatically  satisfies 
conditions  (ii)  and  (iii)  of  the  definition  of  a  linear  subspace  of  If  . 

,  Ox^,  where  xi€?7,  condition  (i)  is  also  satisfied, 
and  71    is  therefore  a  linear  subspace  of  l/~ .  Thus  if  A   is  any  subset 
of  V~   ,  the  set  which  consists  of  all  linear  combinations  of  vectors 
in  JL  ,  denoted  [J-\    ,  is  a  linear  subspace  of  If ,   and  is  said  to  be 
the  linear  subspace  generated  by  ^d   .  If  a  vector  x  can  be  expressed 
as  a  linear  combination  of  the  elements  of  d   ,  then  x  is  said  to  be 
linearly  dependent  on  J.   .  Similarly,  a  vector  x  is  said  to  be  linearly 
independent  of  A   if  x  cannot  be  expressed  as  a  linear  combination  of 
the  vectors  in  A   .  If  X  is  a  set  of  vectors  such  that  no  vector  in 
i  is  a  linear  combination  of  the  other  vectors  in  A   ,  then  J-   is  said 
to  be  a  linearly  independent  set  of  vectors.  For  brevity,  the  term 
linear  is  usually  not  written.  Thus  in  the  vector  space  V  of  3-tuples 
(Example  1),  J-   =  {(0,0,1), (0,1,0), (1,0,0)}  is  an  independent  set. 
Furthermore,  if  x  £  V,   x  =  (  A1?  A2>  ^3)  >   then  x  ■  ZfXjXi  where 
x^  is  the  element  of  A   whose  i-th  component  is  1  and  all  other  com- 
ponents zero.   Thus  J   generates  V  ,  and  every  vector  in  V   is  dependent 
on  A.  . 

A  vector  space  Y   is  said  to  be  finitely  generated  if  there  exists  a 
finite  set  x1,x2,.-,.,3^  of  vectors  which  generates  V .     A  set  of  vectors 
7~   is  a  basis  for  a  vector  space  V   provided  (i)  ^"generates  V ,   and 


(ii)  j    is  independent. >    It  can  be  shown  that  every  finitely  generated 
vector  space  has  a  basis,  and  that  the  number  n  of  vectors  in  any  basis 
is  unique  for  ]/~  .     This  number  is  called  the  dimension  of  V  ,  and  V  is 
said  to  be  finite-dimensional.  If  V  is  not  finitely  generated,  then  V 
is  said  to  be  infinite-dimensional.  Thus  the  vector  space  of  3-tuples 
has  dimension  three.  On  the  other  hand,  the  vector  space  of  finitely 
nonzero  sequences,  (Example  3),  is  infinite-dimensional. 

A  complex  vector  space  1P  is  said  to  be  a  pre-Hilbert  space  provided 
there  is  an  operation  from  ~P  XT  into   C   ,  the  complex  field,  denoted  by 
(x[y),  there  x,ye  "P  ,  and  such  that  this  operation  satisfies  the  con- 
ditions: 

P,   (y|x)  ■  (x|y),  where  X  denotes  the  conjugate  of  A  , 

P2   (x  +  y|z)  =  (x[z)  +  (yfz)  , 

P3   (  Ax|y)  =  A(x|y)  ,  and 

P^   (xlx)>0  when  x  /  6  . 

This  operation  is  called  an  inner  product  or  a  scalar  product, 
the  latter  leading  to  a  convenient  verbalization  of  (x|y):  "x  scalar  y". 
Before  considering  the  properties  possessed  by  the  inner  product,  examples 
of  inner  products  defined  on  the  vector  spaces  already  considered  will 
be  given. 

EXAMPLE  h.     Let  i>   be  the  vector  space  of  n-tuples,  and  x  =  (A-,,  ...,\    )  , 

y  =  {^lt...,Mn)   .  Define  (x\7)   =  Z%±JT±   .  Then  (y\x)   -  E,%iTi 

a  L-,  ?^±/U±   ■  (x|y)  ,  thus  ?1   is  satisfied.  Conditions  Pp  ans  P^  follow 
by  properties  of  the  summation,  and  P^  by  the  identity  A  A  -  j  A(  2  ,  A  *C. 


Thus  7  is  a  pre-Hilbert  space,  known  as  n-dimensional  unitary  space, 

and  denoted  Cn  ..  It  should  be  noted  that  C   is  essentially  the  set 

of  complex  numbers  considered  as  a  vector  space,  with  (x(y)  ■  xy  ,  x,y€C. 

EXAMPLE  fj.  Let  r    be  the  vector  space  of  continuous  functions  on 
the  closed  interval  [a,b]  ,  a<b  .  Then  for  x,y  e.  7°,  define  (x|y)  - 
J   x(t)y(t)dt  .  It  will  now  be  shown  that  (x|y)  thus  defined  is  an 
inner  product. 


To  show  that  P1  is  satisfied,  let  H,  x(ti)y(ti)(tjL  -  t.   )  be  one 
of  the  approximating  sums  of  the  integral  which  defines  (x|y).  Then 
(y|x)  ■   /  x(t)y(t)dt  has  the  corresponding  approximating  sum 

Zn 
i  x(ti)y(ti)(ti  -  t^).  Now  tA  -  ti_1  ,  being  a  real  number,  is  its 

own  conjugate.  Thus 


(  CxftiJyft^t.  -  tw)  =  2>(ti)y(ti)(t.  -  tul). 

Since  the  defining  integral  is  composed  of  the  limit  of  such  sums,  it 
follows  that  (j\x)   -  (xfy)  ,  or  equivalent^,  (y\x)   =  (x|y).  ?2   follows 
by  the  linearity  of  the  Riemann  integral  with  respect  to  its  integrand, 
and  P^  from  the  property  that  J^   Af(z)dz  ■  Xjf(z)dz   .  If  x(t)  ^  0  on 
[a,b]  ,  then  there  is  an  £>0  such'  that  for  some  subinterval  [a1,  b  ],  a,<b  , 
of(a,b]  ,  tefapbj  implies  that  jx(t)|>V£  .  Thus  /'|x(t)|2dt  fc/A£  2dt 

£  2(b1  -  &1)  >o.  This  proves  that  (x(x)  =  ^|x(t)]  2dt  >0  if 
x  ^  9,  and  P,  is  satisfied.  Thus  the  vector  space  1°  forms  a  pre- 
Hilbert  space. 

EXAMPLE  6.  Let  T  be  the  vector  space  of  finitely  nonzero  sequences. 
If  x  -  (Ak)  and  y  =  (/Uk),   define  (x|y)  =  I^\kJTk     •  Since  the  terms 


of  this  sum  are  all  zero  from  some  term  on,  this  sum  is  finite  and  well- 
defined.  The  finiteness  of  the  sum  establishes  P-j,  ,  and  P,  follows 
from  the  fact  that  >kAk  "  j  Ak|  '"  >  0  for  some  k  when  x  ■  (Xk)  4     8  « 

PRE-HILBERT  SPACES 

Some  properties  of  a  pre-Hilbert  space  implied  by  the  inner  product 
are  listed  in  the  following  theorem. 

THEOREM  2.  In  a  pre-Hilbert  space  "f3 ,  if  x,y,z  €  "P  ,   then 

(1)  (x|y  +  a)  -  (x|y)  +  (x|z)  ; 

(2)  (x|Ay)  =   A(x|y)  ; 

(3)  (6ly)  =  (x[e)  =  0  j 

(M  (x  -  y|z)  =  (xlz)  -  (y|z)  and  (xly  -  z)  =  (x|y)  -  (x[z)  ; 
(5)  if  (x|z)  =  (y|z)  for  all  z,  then  x  »  y  . 
Proof: 


(1)   (xly  +  z)  =  (y  +  z\x)  by  ?1  , 


[(y|x)  +  (zlx)]   by  P2  , 


(ylx)  +  (zlx) 

(x(y)  +  (x|z)  by  P-L  . 


(2)  (xl>y)«  (TyU)  =  [>(ylx)]   by  Px  and  ?y 

-  A(y|T) 

=  A(x|y)  by  P1   . 


(3)     (6(y)  =  (.0  +  6|y)  =   (0|y)  +  (e(y)  ,  which  implies  that  (6 |y)  ■  0  . 

Similarly,    (x\6)  =»   (x)0  +  0)  =   (xie)  +   (x|e),   and   (x\0)  =  0  follows. 

(h)     Consider  (x  +  A1y|z)  and   (x(y  +\1z),  where   A  ]_  =    A-,   =  -1  . 
Then  apply  P1  and   (1). 

(5)   (x|z)  ■   (y|z)  for  all  z  implies  that   (x  -  yjz)  -   (x|z)  -  (y\z)  -  0 
for  all  z .     Thus  (x  -  yj  x  -  y )  =  0  j  hence  x  -  y  =  0  by  P.  . 

One  of  the  special  classes  of  vector  spaces  is  the  class  of  normed 
vector  spaces.     This  class  has  a  function  defined  on  each  vector  x  €  V  , 
called  the  norm  of  x  and  denoted     ||xfl    ,  which  has  the  properties: 
nx     ||x||     is  strictly  positive,   that   is     (/x||    £    0  with  equality  only  if  x  ■  d  , 
n2     !)AxU  -      |AJ  [lx|    ,   and 
n3        llx  +  yj|      <     llxll       +      j|y|j 

It  is  evident  that  the  definition         ||x(|       =     ^(xfx)   ,  where  x  is 
a  vector  in  a  pre-Hilbert  space   1°  ,    satisfies  ^  and  n     .     That  this 
definition  also  satisfies  n.   is  shown  in  Theorem  6.     Note  that  in  Example  k, 
the  pre-Hilbert  space  of  n-tuples,         llxll       =   (     jT  j     [\    \    )2>  where 
x  ■  (Ai,...,An)  •     Tn  Example  5,   the  pre-Hilbert  space  of  functions 
continuous  on   [a,b]    ,  ({ x  fl        -  (    J    |x(t)   [    2dt)5    ,   and  in  Example  6, 

the  pre-Hilbert   space  of  finitely  nonzero  sequences,    if  x  =   (  \,)  ,   then 

Hx||     =   (     2_(  jXk)    )^      *     The  following  theorems  give  some  of  the 
basic  properties  of    \lx\\    as  defined  for  a  pre-Hilbert  space   1°  . 
THEOREM  3.      (The  Parallelogram  Law.)     If  x,y  6  >°   ,   then 
II  x  +  y||2  +     ||  x  -y||2  -  2  |jx|/   2  +  2||y  II  2     . 
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Proof: 

2 

First     \lx  +  y  II      -   (x  +  y]x  +  y)  -  (x  +  y|x)  +  (x  +  yjy) 

■   (xjx)  +   (yjx)  +   (x/y)  +   (y|y), 

-  I/xll2  +  By//  2  +  (yix)  +  (x|y)  . 
Also     ||x  -y||2  -   (x|x)  -  (y|x)  -  (x|y)  +   (y/y) 

=   llxll   2  +  l/yjf  2  -  (y/x)  -   (x|y)    . 
The  result  follows. 

THEOREM  li.      (Polarization.)     If  x,y  £  f,   then 

(x|y)  =  1/U([|lx  +  yll  2  -  Ux  -y  I2  +  i  ||x  +  iy  |(  2  -  i  )|x  -  iyl(  2] 
Proof: 

Using  the  first  half  of  the  proof  of  Theorem  3,  with  y  replaced  by 
-y>   iy>   and  -iy  successively, 

II  x  -y||2=     «x||2  +  [lyll  2  -   (y[x)  -  (x|y)   , 
II x  +  iy  II  2  =  Ux  ||  2  +  || yll  2  +  i(y|x)  -  i(xfy)  ,  and 
(Ix  -  iy  ||  2  =  /fxll  2  +  llyfl  2  -  i(y|x)  +  i(xfy)   . 
Thus       -  ||x  -  y  ||  2  =  -  (Ixll  2  _  jiyji  2  +   (y(x)  +(x(y)   ^ 

i  II x  +  iyll2-  i  I|x||  2  +  iljyl]  2  -     (y|x)  +   (x|y)   , 


-i  Ux  -  iy  ||2  =  -i  ||x||   2  -  idyll  2  -   (y|x)  +  (xfy)    , 
But       U x  +  yll  2  =  llxll  2  +  llyll  2  +   (yjx)  +   (x|y)  . 
By  summing  the  last  four  identities, 

U(x(y)  -  Ux  +  yll  2  -  ||  x  -  y||  2  +  i  (|x  +  iy  /I  2  -  i  flx  -     iytf  2     , 
as  desired. 

THEOREM  5.      (The  Cauchy-Schwarz  inequality.)     If  x,  y£  P,  then 
|(x|y)|  '<£  llxll    ||yll    . 

(Since  this  theorem  is  a  special  case  of  Theorem  9,  it  will  be  proved 
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after  Theorem  9  is  established.) 

THEOREM  6.   (The  Triangle  inequality. )  If  x,y  €.  P   ,  then 

II  x  tyM  llxll  +  O7 1|     . 
Proof: 
Again    ||  x  +  y  II  2  =*  II  x  II  2  +  llyll  2  +   (x[y)  +   (xly) 

•        -     llxll  2  +  ||y||  2  +  2  ftj(xly)] 

^.  llxll  2  +  0y|(  2  +  2|(xly)|   ,  where  ft.  (  A  ) 
denotes  the  real  part  of  A  .  Now  applying  the  Caucy-Schwarz  inequality, 
II  x   +  yll  2  <  llxll  2  +  l/ylf  2  +  2  (Jxil  II  jl   =  (  Qx//  +  (lyf/  )2  . 

Theorem  6  is  actually  the  statement  n^  of  the  third  property  of  a 

x 
norm  and  justifies  writing  llxll   for  (x|x)2   .  Thus  every  pre-Hilbert 

space  becomes  a  normed  Vector  space  with  llxll  thus  defined.  Consider 

now  the  pre-Hilbert  space  of  complex  numbers,  which  is  essentially 

Example  f>  with  n  =  1.  For  vectors     A  j    jU    €    C   ; 

W\-jU\\  .     {\\-jU\z)L>    '     I  x  -u\  } 

which  also  represents  the  distance  from  the  point  A  to  the  point  JU  in 
the  complex  plane.  This  provides  the  motivation,  and  the  following 
theorem  the  justification,  for  defining  llx  -  y  II  ,  where  x  and  y  are 
vectors  in  a  pre-Hilbert  space  P  ,  to  be  the  distance  between  x  and  y. 
THEOREM  7.  If'x,y,z  ef,    then 

(1)  l/x  -  y/|  £.  0  ,  with  equality  iff  x  =  y  , 

(2)  llx-yll  =  || y  -  xll     ,   and 

(3)  ||  x  -  zj|    <  Hx  -y|(     +  Hy  -  Z||     . 
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Proof : 

(1)  jlxlj    >  0  if  x  /  9  ,   and   (e/e)  -  0  shows  that      Hell    -  0  . 

(2)  [x-y]  -  -  [y  -x]     . 

(3)  Use  Theorem  6,  with  (x  -  z )  =  (x  -  y)  +  (y  -  z)  . 

Any  nonempty  set  X   having  a  distance  function  defined  on  every 

pair  of  elements,  such  that  d  is  strictly  positive,  symmetric,  and 

satisfies  the  triangle  inequality,  is  said  to  be  a  metric  space.  These 

properties  are  precisely  those  of  II x   -  yll  given  by  Theorem  7.  Therefore 

every  pre-Hilbert  space  T   is  a  metric  space,  with  d(x,y)  =  ||  x  -  y  \     , 

for  all  x,y  £  f*  .  Given  a  metric  space  X  ,  a  sequence  {x^}  is  said  to 

be  a  Cauchy  sequence  if  d(xm,xn)-*0  as  m,n  ->  <~  .  If  {xn}  being  a 

Cauchy  sequence  implies  that  there  is  an  x  €  X  such  that  d(x,xn)  ->  0 

as  n  -»•  oo  ,  then  X  is  said  to  be  complete.  Two  of  the  given  examples 

of  pre-Hilbert  spaces  fail  to  be  complete.  Example  5,  the  pre-Hilbert 

space  of  functions  continuous  on  [a,b]  ,  is  incomplete.  The  sequence 

of  functions 

(b  if  -1  &    t  ^  0  , 

xn(t)=Vnt  if  0  <t  <  ~   , 

,iif;  .<t*i     , 

defined  and  continuous  on  [-l,l]  ,  is  a  Cauchy  sequence,  but  it  fails  to 
converge  to  a  continuous  function  defined  on  L-1>1J  •  Again>  Example  6, 
the  pre-Hilbert  space  of  finitely  nonzero  sequences,  is  incomplete.  Con- 
sider the  sequence  (x^)     =     {(1,  ->->•••»-  >0>°> ••• )}  •  Then 
IK+p-*nl|2=  l)(0,....0,i,^,...,i,0,...)||2.   £";:  f 


.     ]2 

•<  I 
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The  series    JL  ,  T2-    is  known  to  be  convergent,   thus    2.^,  J  a    ->    0  as 
n_oo.     Hence  d(xn+p,xn)  =  ^|^     -+0      »«»--• 

Thus     (xl  is  a  Cauchy  sequence.     Assume  jxnj    -»  x  ,   x  €    r   •     Let 
x-   (X1,A2---XN,0,...)       .     Then  choosing  n  I>  N,     |/xn-x|f2  = 

Letting  n^oo,     ^^  ~  j  -^  —  >  ^  J       =0.     This  requires  that     A  k  =      fc 
for  all  k,  which  contradicts  the  hypothesis  that  x  is  finitely  nonzero. 
Thus  no  such  x  exists,   and   (x  J  does  not  converge  to  a  vector  in   7°  . 

A  Hilbert  space  is  defined  to  be  a  complete  pre-Hilbert  space.     Two 
examples  have  been  given  of  pre-Hilbert  spaces  which  fail  to  be  Hilbert 
spaces;  the  following  is  a  complete  pre-Hilbert  space. 

EXAMPLE  7.      (The  Hilbert  Space  l2. )     Let  V  be  the  set  of  all  abso- 
lutely square-summable  sequences   (A^)    ;  that  is,     XLT    I  A  kf       "^ °°  j  ^k**-- 

LEMMA  1.      If  x  -   (Ak)  and  y  "   (fJ^)  are  in  #      >   and  if  A*£,   then 
x  +  y  and   Ax  are  in   Y  . 

Proof: 

Applying  Theorem  3  to  the  complex  numbers 

Uit  +  A<lcl2ti;\it-;"k|i!-2Ma+  2'^'2  • 

Hance    i;uk^.i!  i  2i.-'ix.i*  *  zzrijuj*  <- 

Therefore  the  sum  of  two  elements  of  y   ,  and  any  scalar  multiple  of  an 
element  of  M  is  an  element  of  7/   .  J4   forms  a  vector  space  under  these 


Hi 


operations,  with  6  the  sequence  all  of  whose  terms  are  zero. 

LEMMA  2.   If  x  =  (Xk)  and  y  =  (juk)   belong  to  H   ,  then  the 
series  Y—  7    X^  JJk    is  absolutely  convergent. 

Proof: 

9  1     o  2 

For  all  real  numbers  a  and  b,    (a  -  b)*     iO,   hence  ab  £.  -(a     +  b   )   . 

Thus  !AkAl»      \Xk\\jUk\    *     i(l\ktZ  +IU«lZ)     ■ 


Hence 


zr  WxjjA4-  t  (£."  ix  j'  +l: \juk\*)<~  . 


With  this  lemma  as  justification,   let   (x\y)  =      £.''  Ak/7*  >   since 
this  series  converges.     Then  (x|y)  satisfies  P,   i    .     Thus  ^  is  a  pre- 
Hilbert  space.     To  complete  the  proof  that   V-  is  a  Hilbert  space,   let 
[xnj  be  a  Cauchy  sequence  of  vectors  in  H.     That  is,     \\  xm  -  xn  //  —>  0 
as  m,   n  — >oo  .     Let  xn  =     (  ^     ).     Now  for  each  k, 

Thus  the  sequence   (   \  ,   ),  where  n  is  the  running  index,   that  is,   the 
sequence  of  k-th  components  of  {xnj,   is  a  Cauchy  sequence.     Since  the 
complex  numbers  are  complete   (   X  u  )  ""*   X^asn   -*<*»,  where      \k  *  C-   . 
It  will  now  be  shown  that£xk|  -*(  \  k) ;  that  is,  /^   is  complete. 

LEMMA  3.       Z7~   |"Xk|*    <:  &o     ,   andfxnj  -+  x,   as  n  -^.  ©o    , 
where  x  =   ( Ak). 

Proof : 

Choose  £  >0  and  a  positive  integer  p  such  that  [\xm   -  xn  H 2  ^  £ 
whenever  m,n  2  p  .  Now  for  any  positive  integer  r, 

Zi;.,  I  AT  -A'M  *  ix"-x"l*<£, 
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provided  that    m,n  2p.     Let  m  -*  oo  ;   then        Z_  k.,    (   A  k         Ak    i  <- 

provided  n  >  p  .     Hence       Z  k~  l    I    X  k    -  a' £  1  ^   £         ,  which  implies 
that  the  sequence   (  Ak  -    X**)  is  a  vector  in    Jf  .     Adding  this  to  the 
vector  x>=   ('  X^)   ,  x  =   (  Xk)£  ?/    by  Lemma  1.     Now  from    T?  \\k~  t*  I    "0 
it  follows  that     ||  x  -  xn  I!2   *  £   whenever  n  £  p  .     Thus  (xn)  ->  x  . 
The  Hilbert  space  W  of  absolutely  square-summable   sequences  is 

p 

usually  denoted  1  (. 

ORTHOGONAL  VECTORS 

If  x  and  y  are  vectors  in  a  pre-Hilbert  space  ~P   ,  and  if  (x|y)  =  0  , 
then  x  is  said  to  be  orthogonal  to  y.  This  is  denoted  xly  .  Since 
(x|y)  =  (~ylx~)  and  (x|x)  =  0  iff  x  =  6  ,  it  follows  that  ylx  if  xiy  ,  and 
that  x  J.x  iff  x  =  6  .  Again,  xle  holds  for  all  x  €  j°   .  A  set  J  of 
vectors  is  said  to  be  orthogonal  if  xly  for  any  distinct  x,y£  J-   . 
A  sequence  {xn}  of  vectors  is  defined  to  be  an  orthogonal  sequence  if 
Xj_-Lx.j  whenever  i  /  j  . 

LEMMA  h.  If  x  is  orthogonal  to  each  of  y1,y2, • • • ,Yn,   then  x  is 
orthogonal  to  every  linear  combination  of  the  yk  . 

Proof: 

If  7  -   E?  X^  )then  (x|y)  -  H*  Xk(*lyk)  ■   E?  Ak°  =  °  • 

THEOREM  8.   (The  Pythagorean  Relation.)  If  xly,  then  |Ix  +  y  II 
llxlj  2  +  //y||  2  .  In  general,  if  xx,  ...,xn  form  an  orthogonal  set, 
then   Un^f-      X>kU2  • 
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Proof: 
|| x  +  y  II2  =  (x|x)  +  (x|y)  +  (y|x)  +  (y|y) 


=     |xll  2  +  0  +  0  +     ||y||  2  . 


Assume  inductively  that  J  T. ,  \  |]     =       ]E"'1xkl/  '      '     Then  for 

x  =       Z_  ,  xk  and  y  =  xn  ,   xly  by  Lemma  U.     Thus 

||H%k||2  =    !/x+yJ|2=     i/x//2+|fy||2 

=    |lLT'xk||2+      Kl/2 

-  r;'iixkii2+  }(xn«2 

by  hypothesis.  But  this  last  sum  is  JZ  ,  Hxkll  '  >  which  completes  the 
induction. 

COROLLARY.  If  [xjj  is  an  orthogonal  sequence  of  nonzero  vectors, 
then  the  x^  are  linearly  independent. 

Proof: 

The   sequence   (Anx  }  is  orthogonal.     Hence  if     Z_  |  A^x,    =  0,   then 

byTheoremS,     f  £  "  A^  |j  2  =   0  =     H"  |  >  ^  j2  -   £"|  Xk  |  2  Ij^  |  2 

But       ilxk)I     >    0  for  all  k,   hence     Xk  =  0  for  all  k. 

An  orthogonal  sequence  {x  }  such  that     |)x   ||  =  1  for  all  n  is  said 
to  be  an  orthonormal  sequence.       This  may  be  expressed  by  writing 
(x^x^  =      <5  -jk  >  where     (j  -jk  denotes  the  Knonecker  delta  function. 


EXAMPLE 


2 


8.  In  the  Hilbert  space  1*   ,  let  {e  }   be  the  sequence  whose 
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n-th  terra  is  the  sequence  whose  n-th  term  is  one,  and  all  others  zero: 
e  =  (0,  0, ...,0,1,0, ... )  ,  where  n-1  zeros  precede  the  1  .   (It  is 
clear  that  the  absolute  square-sura  of  each  e,  is  one,  hence  en  €    1 
for  all  k.)  Again,  (ej|ek)  =   (5jk  .  If  x  =  (Ak)  ,  then  (xjej)  =  Aj 
Thus  for  every  vector  x,     Z^/W6^  <   °°  •  That  a  similar  result 
holds  for  any  orthonormal  sequence  of  vectors  is  shown  by  the  following 
theorem. 

THEOREM  9.   (Bessel's  Equality  and  Inequality.)  Let  x-,...,x  be 
an  orthonormal  sequence  in  a  pre-Hilbert  space  r   .  Then  for  every 
vector  x  , 

(1)  Ifx-   rr(x|xk)xk(/2=  Jlxll  2  -   £?l(x|xk)|2  ,  and 

(2)  ;Cl(x(xk)|2  4  /|x||2  . 
Proof: 

By  Theorem  8,  if  Ak^  C  for  k  £  n  ,  then 


n 


Z,ixk|2  sxk«J 


By  expanding  the  norm: 

,2 


*  -  E? xkxk  f  -  iixii 2 .  (  z^  a^ix)  .  (x  i  z;  a^)  ♦  £(*|  >kj  - 
=  (//xii2-  Z?Ak^K)  -   L>xkUk  +  Z"xkxk 

-     ilxll2-    ZZ"|(x|xk)|2+    Z^[(x|xk)(xT^k")   - 


Xk(xfxk)  -  AkWxk)  +    Ak  AkJ 
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=  Gxll  2  -    L?l(xlxk)|2  +      H"[(x(xk)  -\k]  [(x(xk)  -  Aj 

=  ijxii2- ni(xixk)i2  + r:i(xixk)-Aki2  . 

Setting    \k  =   (x|xk)  gives  the  result   (1)   .     Since    (I  x  -    Zr(x)xk)xk  |{      £    0, 
result  (2)  follows  immediately. 

COROLLARY.     If  {xn}  is  an  infinite  orthonormal  sequence  in  a  pre- 
Hilbert  space,   then  2ZrKxlxk)(      ^     ||xll       .     Consequently,    (x|x,  )    — >•  0 
as  n  — *■  oo  , 

Proof: 

This  is  an  immediate  consequence  of  Bessel's  inequality,  since  this 
result  must  hold  for  each  n. 

Since  (Ixll  is  strictly  positive,  equality  will  hold  in  (2)  iff  x  is 
a  linear  combination  of  the  xk  .  That  this  is  sufficient  in  order  to 
have  equality  follows  from  the  implication:  if  x  =  2—  ,   A  i^  , 
then  A,  =  (x(x  )  .  A  generalization  of  this  result  is  given  by  the 
last  step  in  the  proof  ,of  (1),  which  shows  that  the  choice  Ak  =  (  xlxk) 
minimizes  (I  x  -  2Z ,  Xi^k'l  *  Therefore  this  choice  of  the  Ak  provides 
the  best  approximation  of  x  by  a  linear  combination  of  the  x<  . 

Theorem  5,  the  Cauchy-Schwarz  inequality  ((xly)j  <   II  xll  /lyll  ,  will 
now  be  proved. 

Proof  of  Theorem  5.  If  either  x  or  y  is  6  ,  then  both  sides  are 
zero.  Assume  y  ^  9,  and  let  y0  =  y  l/y/l   ,  so  that  ||y0ll  =  1.  Then  yQ 
forms  an  orthonormal  sequence  of  one  term.  Hence  by  part  (2)  of  Theorem  9 
above,  ((x|y0)|  c  ±  [Ixll  .  Replacing  y0  by  y  ||yH  ~A  ,  the  theorem  follows. 
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It  has  been  shown  that  every  orthonormal  sequence  is  linearly  inde- 
pendent. A  construction  process  will  now  be  described  by  which  an  ortho- 
normal  sequence  may  be  derived  from  a  given  linearly  independent  sequence. 

LEMMA  5.  In  the  notation  of  Theorem  9,  let  y  =   JE\(x!xk)xk  ,  and 
let  z  =  x  -  y  .  Then  (zfxk)  =  0  for  all  k  ^  n  . 

Proof: 

(z|xk)  =  (x|xk)  -  (yjxk) 

■  (xjxk)  -  (21  ,n  (x|Xj)Xj|xk) 

-  (xjxk)  -  (xlxk)(xkJxk)  =  0, 

f 

where  the  sum  of  the  scalar  products  given  by  (y(xk)  reduces  to  the 
one  term  by  virtue  of  the  fact  that  [x,  }  is  orthonormal. 

THEOREM  10.   (The  Gram-Schmidt  Orthonormalization  Process.)  If 
y-j_,yp,...,  is  a  sequence  of  linearly  independent  vectors  in  a  pre-Hilbert 
space  r  ,   then  there  exists  an  orthonormal  sequence  x-,,x?,...  in  "P   such 
that  iy]_,...,y  ]  ■  [x  , ...,x  ]  for  all  n  (the  yk  and  the  xk  generate 
the  same  subspaces). 

Proof: 


Def: 


ine  x-|_  ■  y^  //  y-,  //  "  .  Clearly  x-,  and  y,  generate  the  same 
subspace.  Assume  inductively  that  orthonormal  vectors  x, ,x-,...,x 
are  given,  such  that  f  y^, . . .  ,yn_.jj  =  [xp  ...,xn-]   .  Let 


z  = 


yn  -  2_,  ^ynlxk)xk  •  By  the  iemma>  zJ-xk  for  a11  k  -  n  •  if  z  =  e  , 

then  yn  is  a  linear  combination  of  x  ,  ...,x    ,  hence  also  of  y-^  ...,y 
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which  contradicts  the  independence  of  fyn}  •  Thus  z  ^  6  ,  and  it  is 
permissible  to  define  x  =  z  \\z  {\~     .  Now 


Hi  Ak*k  -  Znk".'(Akxk  +  Anyn  -  An  £  *..(ynK)xk 
■  r^^k-^n^V]  xk  +  xnyn 

=   ^  jUk^  +  A  y   by  the  induction  hypothesis. 
Thus  [x,,...,xn]  =  [y  ,'..., y  ]  ,  which  completes  the  induction. 

Since  every  finite  dimensional  pre-Hilbert  space  has  a  basis,  an 
immediate  consequence  of  Theorem  10  is  that  every  finite-dimensional 
pre-Hilbert  space  has  a  basis  which  consists  of  orthonormal  vectors. 
This  property  is  essential  for  the  proof  of  the  following  theorem. 

THEOREM  11.  Every  finite -dimensional  pre-Hilbert  space  r   is 
complete,  hence  is  a  Hilbert  space. 

Proof: 

Let  r   be  of  dimension  n,  and  let  x1,...,xn  be  a  basis  for  P 
consisting  of  orthonormal  vectors.  If  x  =  £_i  A  icxk>  then  ||xj(   = 

LlUAJAA5     =     ^(Ak|2  by  Theorem  8.     Thus  if  {yj  is  a 

M^A  >  then   i  ym"yp 


Cauchy  sequence  in    P  ,  where  ym  =    £_  ^  ,  X  kxk  '   then   (I  ym  "  yi 
■^—  i  I   Afc        "  Ak  I       +0  as  m,p  ->  ©o     .     In  particular, 
I  Afc  ~  Ai<  f      <      y_  -  yn  II      for  each  fixed  k.     Thus  the  sequence 


>k  '    *  H ym  -yP 

(  A' u  )  >   where  m  in  the  running  index,  is  a  Cauchy  sequence.  The  complex 
numbers  are  complete,  therefore  (  \  ,  )  — >  (X  >)  as  m  — >   <=o   .  For  a 
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fixed  k  and  any    £  >0  ,   there  is  an  Mk(  £  )  such  that    I  Ak      ""  ^  k  I  *  t£ 

for  all  m  >  Mk(  £  )   .     Let  M(  £)  =  max  (Mk(  £ )}  .     Thus  if  y  -    Z_  ,  A^  > 

then      ||ym-y||2=      £1"  I  X?  "  Akl*   <£?£     -    H  £  for  m  >M(<£  ). 
Thus  {y   }  -*  y  .     Clearly  y   €    P  .     Therefore  "P  is  complete. 

HILBERT  SPACE 

A  few  general  properties  of  Hilbert  space  will  now  be  discussed. 
The  first  of  these  is  concerned  with  infinite  suras  of  vectors  in  an 
infinite-dimensional  Hilbert  space.     If  x-^XjjX  , ...   is  a  sequence  of 

,  xk  converges  to  a  limit  x,   then 
this  limit  is  denoted  x  =       2—  i  x-,    . 

THEOREM  12.     If  fcOis  an  infinite  orthonormal  sequence  of  vectors 
in  a  Hilbert  space   7t    and    {A_]     a  sequence  of  scalars  such  that 

2— i  (Xkl      <°°     ,   then  the  sequence     [yn]  ,  where  yn  =  Z,Akxk, 

Loo 
,     \     X,    . 


Proof: 

n+p 


2  V^n 

lK+p"ynll      =     2-^JUj^l 

Z.  J,  |  A 


2 


0 


as  n    ■ — f  00  .     Thus   [yn]  is  a  Cauchy  sequence,   and  {yn}  -*  x  £  rr    since 
Vr  is  complete. 
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LEMMA  6.     In  a  pre-Hilbert  space         , 

(1)  if  {xn}+x  and  {yn\->y  ,  then{(xn|yn)}-Kx|y)    J 

(2)  if  \x^[  and  {yn]    are  Cauchy  sequences  of  vectors,   then  {(xnlyn)|   is  a 
Cauchy  sequence  of  scalars,  hence  is  convergent. 

Proof: 

(1)  For  all  n,    (xnjyn)  -  (x/y)  = (xn  -  x/yn  -  y)  +   (x(yn  -  y)  +(xn  -  x|y) 

Hence 

|(*nW  ~  (x|y)'   *   '(xn  "  xfrn  "  ^1    +   f(x'yn  "  *>(    + '(xn  "  x/y>l 

]jxn  -x|f.//yn  -y|/+'//x/|  -//yn-y// 

+  K  -x|/.  || yj[ 
by  the  Cauchy -Schwarz  inequality.     Since  the  right  side  of  this  inequality 
goes  to  zero  as  n— *<»,    it  follows  that   {(xn|yn)}  — *•  (x[y) 

(2)  fclTj  -  (xm|ym)|   4    K  -  xm(|  .  |yn  -  yml| 

It  will  be  shown  that   every  Cauchy  sequence  is  bounded,   thus  the  right 
side  of  this  inequality  must  go  to  zero  as  n,ra-^«o.     Choose  N  such  that 
(jxjjj  -  x   y     <    1  whenever  m,n    ^    N.     Then 

llx^  =  llxn  -  ^  +  ^11   *  /|xn  -  xN|[  ♦  //xN//<  1  +J/xNU 

for  all  n   >    N.     Choose  M  =  max    [l  -  (^  II  ,  flx^l  ,...,    II  x^  jj     . 
Then     flxj    <  M  for  all  n. 

THEOREM  13.     Let  (xn)  be  an  orthonormal  sequence  of  vectors  in  a 
Hilbert  space   V  .     If  x  =  y>  T  ^  ^\  and  7  ■         Xli/VjX,    in  the  notation 
of  Theorem  12,  then 
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(1)  (x|y)  ■    2_t   AkAk  »  where  this  series  converges  absolutely; 

(2)  (xjxk)  =  Ak  j 

o)  1x1/2=    x:/xk/2  -  i::i(x!xk)|2 . 

Proof: 

(1)     Let  sn  -    JT(  Akxk  and  tn  =        Z,//kxk  ,   so  that   (sn  ]  -x  and 
foj  -^3T  by  definition.     Then  (snJtn)  -*  (x\y)  by  Lemma  6.     Thus 

y  n        — 

Z_  ^ . ,    A  k/^k     • 

Thus  (xjy)  =        2_,   Ak/^k     *     That  the  sequence  is  absolutely  convergent 
follows  by  Lemma  2. 

(2)  'In  (l),'let  //     =  1  and  JJ    =  0  ,   j  /  k  . 

(3)  In  (1),  let  x  =  y,  then  let  Ak  =  (x[xk)  by  (2). 
The  sequence  fxj  need  not  be  orthogonal  in  order  to  give 

L—\  xn   meaning.  For  example,  if  (xj  is  a  sequence  in  a  Hilbert 
space  X   such  that   ]E-7lJxkl|  <    00   ,  then  the  sequence  {yn)  =  [Z.* xk) 

is  a  Cauchy  sequence  ;  (|yn+p  -  ^  j(=  \\£   ^  )|  <  £  ^  (J  ^  by 

Theorem  6.  Thus  ||yn+p  -  yj    ->  Oasn^  .  since  X   is  complete, 
it  follows  that  {yj  -*x  £  ?/  ,  where  x  =    51^  . 

If  lxrJ  is  an  infinite  orthonormal  sequence  in  a  Hilbert  space  74 
then  by  the  Corollary  to  Theorem  9,  the  scallars  Ak  =  (x)xk)  satisfy 
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Y"°h  |2  <  co    for  any  x  €  V .     Thus  by  Theorem  12,  there  is  a 

y  6  #  such  that  y  =   Z_  ,  X  kxk  ,  and  by  Theorem  13,  (yi*k)  ■  Ak 
for  each  k.  Thus  (y  -  x|xk)  =  (y|xk)  -  (x|xk)  =  Ak  -  Ak  =  °  for  each' 
k.  However,  this  would  imply  that  x  =  y  iff  z  «#  and  zlxk  for  each  k 
implies  that  z  =  9  .  This  leads  to  the  following  definition:  a  set  of 
vectors  in  a  pre-Hilbert  space  V   is  said  to  be  total  if  the  only  vector 
in  ft   which  is  orthogonal  to  every  vector  in  ■&   is  the  vector  0  . 
Similarly,  a  sequence  [x^  in  V   is  a  total  sequence  if  z-bck  for  all  k 
implies  that  z  =  6  . 

EXAMPLE  9.  The  set  of  all  vectors  in  a  pre-Hilbert  space  r  is  a 
total  set,  since  z  lx  for  all  x  £  f  implies  in  particular  that  zlz, 

hence  z  =  8  . 

p 

EXAMPLE  10.  In  the  Hilbert  space  1  ,  the  sequence  whose  n-th  term 

is  en  =  ( OjO,.., 0,1,0.. . )  ,  where  the  1  is , the  n-th  component,  is  a  total 
sequence  of  vectors.  This  sequence  is  also  a  total  sequence  in  the  pre- 
Hilbert  space  of  finitely  nonzero  sequences. 

A  sequence  of  vectors  |"x  j  ,  either  finite  or  infinite,  in  a  Hilbert 
space  7/   is  said  to  be  an  orthonormal  basis  for  ft    provided  (i)  [x  ] is 

orthonormal,  and  (ii)  {xn{  is  total.  The  sequence  described  in  Example  10 

p 

is  an  orthonormal  basis  for  1  .  It  is  called  the  canonical  orthonormal 

basis  of  1  . 

If  r  is  a  pre-Hilbert  space  possessing  a  finite  sequence  x-,,  ...,x 
which  is  orthonormal  and  total,  then  by  Lemma  5,  x  -  2—\    (x|xk)xk  is 

orthogonal  to  each  xkj  thus  x  =  2—  ,  (x\xk)xk  .  It  follows  that  the 

xk  form  a  basis  for  r    ,  and  r   has  dimension  n.  The  following  theorem 
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shov/s  that  a  similar  result  holds  for  infinite-dimensional  spaces. 
THEOREM  II4.  If  jxjj  is  an  infinite  orthonormal  sequence  in  a 
Hilbert  space  V   ,  then  the  following  are  equivalent: 

(1)  (x^  forms  an  orthonormal  basis  of  7/  . 

(2)  ^~|(x[xk)[2  =  II  x  \\  2   for  each  x6%   . 

(3)  H,  (xlxk)xk  =  x  for  each  x  6  ^  ' 
Proof: 

Statement  (1)  implies  statement  (3)  by  Theorems  12  and  13.  State- 
ment (3)  implies  statement  (2)  by  Theorem  13,  part  (3).  Statement  (2) 
implies  statement  (1),  since  if  (xfxk)  =  0  for  all  k,  then  (|xl|  =  0, 
hence  x  =  6  . 

Not  every  Hilbert'  space  has  an  orthonormal  basis.  Those  that  do 
certainly  contain  a  total  sequence,  and  such  Hilbert  spaces  are  said  to 
be  separable.  This  condition  is  not  only  necessary  in  order  that  a 
Hilbert  space  have  an  orthonormal  basis,  but  it  is  also  sufficient. 

THEOREM  15.  The  following  statements  about  a  Hilbert  space  W   are 
equivalent: 

(1)  //is  separable. 

(2)  H    has  an  orthonormal  basis  {xn}  . 
Proof: 

(2)  implies  (l)  by  the  definitions  of  an  orthonormal  basis  and  of 
separable  space.   (1)  implies  (2)  :  Let  Z]_,  Zp,z,, . . .  be  a  total  sequence 
in  N  .  Then  there  is  a  linearly  independent  subsequence  y^,  yp,  y-5,..  of 
{zk}  which  generates  the  same  linear  subspace  as  {zk]  .  This  follows  by  a 
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constructive  argument:  if  zk  =  6  for  all  k,  then  let  the  sequence  be  the 
empty  sequence.  This  by  convention  is  taken  to  be  an  independent  sequence 
which  generates  the  subspace  8  .  If  not  zk  =  6  for  all  k,  then  let  k±  be 
the  smallest  index  such  that  zk  J   6  .  Then  for  k  <  k±   ,  zk  =  6  =  0zkl  . 
If  for  k  >k1   each  zk  is  a  scalar  multiple  of  zk  ,  then  let  the  sequence 
{yn]  consist  of  the  single  term  zk  .  If  this  is  not  the  case,  let  k2 
be  the  least  positive  integer  greater  than  k1  such  that  zk  is  not  a 
multiple  of  zk  .  If  each  z^   for  k > k?  is  expressible  as  a  linear  com- 
bination of  zv     ,  z,   ,  then  let  {yn}  consist  of  the  vectors  zk  and 
1    k2  1 

z,   .  If  this  is  not  the  case,  let  z,   denote  the  next  vector  not 
k2  *3 

expressible  as  a  linear  combination  of  zk  ,  zk  ,  and  continue  the 
process.  If  fzk)  is  finite,  then  the  process  will  exhaust  the  sequence, 
yielding  a  finite  independent  subsequence  {zk  }  =  (y ^  }  .  If  /zk}  is  not 
finite,  the  process  continues  inductively.  The  sequence  (possibly 
finite)  which  the  process  yields  is  also  total:  if  a  vector  z  is  ortho- 
gonal to  every  y .  ,  then  it  is  orthogonal  to  every  linear  combination  of 
the  y^;  therefore  it  is  orthogonal  to  each  zk  . 

By  Theorem  10,  there  is  an  orthonormal  sequence  [xA   generating 
the  same  subspace  as  [y^}  .  An  argument  similar  to  the  one  used  to  show 
that  |y-}  is  a  total  sequence  shows  that  fx^j  is  also  total;  which 
completes  the  proof. 

Two  Hilbert  spaces  H   and  M   are  said  to  be  isomorphic  if  there  is 
a  one-one  mapping  T  of  M   onto  K   such  that  for  x,y  €  %  , 
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(i)   (x  +  y)T  ■  xT  -f-yT   ; 
(ii)   (/U)T  =  A(xT)  for  any  \€  C; 
(iii)   (xT(yT)  =(x[y)  . 
The  function  T  is  called  a  Hilbert  soace  isomorphism  of  Jr    onto  1\   . 

Two  types  of  separable  Hilbert  spaces  are  given  in  the  following 
examples.  The  next  theorem  shows  that  up  to  isomorphism,  these  are  the 
only  separable  Hilbert  spaces. 

EXAMPLE  11.   If  Jr   is  a  finite-dimensional  Hilbert  space,  then  every 
basis  is  total,  thus  n    is  separable.   In  particular,  for  any  positive 
integer  n,  the  unitary  space  C      is  an  n-dimensional  separable  Hilbert 
space. 

EXAMPLE  12.  The  Hilbert  space  l2  is  separable,  as  is  shown  by 
Example  10. 

THEOREM  16.  If  }f   is  a  separable  Hilbert  space,  then  )/   is  iso- 
morphic  to  C      or  to  1  ,  according  as  }/  has  finite  dimension  n  or 
has  infinite  dimension. 

Proof: 

First  assume  that  )/    has  dimension  n.  Then  by  Theorem  l£,  M   has 
an  orthonormal  basis  x-^  ...,xn  .  By  Lemma  5,   if  x£/  ,  then 
(x  -  2-.t  (x(xk)xk)J_xk  for  all  k.  The  sequence  xn  is  total;  therefore 

x  =  Z.i  (x(xk)xk  .  Let  xT  =  ((x(xk))  ;  that  is,  xT  =  (A  n)  €  C" 

such  that  X±   =  (xlxj^).  Then  if  x,y€#such  that  x  ^  y,  then  x  -  y  ^  9  . 

Hence  (x  -  y|xk)  /  0  for  some  k,  say  k  =  j.  Thus  (xlx.)  -  (yjx.)  4   0, 

from  which  ((x|xk))  ^  ((y|xk))  .  If  (*n)  €  £n,  then  x  =  21  A^  £  #  . 
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By  the  remark  above,  this  implies  that  Ak  =  (xfxk)  •  therefore  T  is 
one-one  from  %   onto  C       .         . 

(i)  x  +  y  =   -EZ,fxfxk)  xk  +  (yj^^jj 

.    •   £n,Nxk)  +  (yixk)J  xk 
-  z_,  (x  +  yixk)xk 

Thus  (x  +  y)T  =  ((x  +  y(xk))  =  ((x|xk)  +  (y|xk)) 

-  ((xlxk))  +  ((y[xk)) 

=  xT  +  yT  . 
(ii)   Ax=   All"(x|xk)xk  =    H!1(Xx|xk)xk 

Thus  (Ax)T  =  ((Ax[xk))  =  (X(x|xk)) 

=  A((xlxk))  =  X(xT)   . 
(iii)  (xjy)=(  Ev.,(xlxk)xk  J   £  "w(y[x.)Xj) 

=  L^  Hj.,(x/xk)(yixj)(xklxj) 

r-  n      

=  Zk,((xfxk)(y(xk) 

=   (xTlyT)  . 

If  on  the  other  hand  hf   has  infinite  dimension,  then  U     is  isomorphic 
to  1   .  Again  an  orthonormal  basis  fxn}  is  given  by  Theorem  1$.  By  the 
Corollary  of  Theorem  9,  H"|(xjxk)  (  2  <   oo   .  thus  it  is  meaningful  to 

define  xT  =  ((x[xk))  .  If  x  /  y,  then  (x  -  y|Xj)  ^  o  for  some  J  ;  thus 
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xT  4  yT  .  Let  (^k)  f  l2  ,  and  define  x  -  Z." \k\  •  B?  Theorem  13, 
Aj  =  (x|x.=  )  .  Thus  xT  =  (  A  k)  '  and  T  is  one-°ne  from  M  on^°  !  • 
If  x,y  £  7/  ,  then 

(i)  (x  +  y|xk)  -  (xlxk)  +  (y|xk)  for  all  k,  hence  (x  +  y)T  =  xT  +  yT  . 

(ii)  (Ax|xk)  =   A(x|xk)  ,  thus  (  Xx)T  =  A(xT)  . 

(iii)     By  Theorem  lb,   x  =       HT(xlxk)xk  and  y  =       £.,  (yjxk)xk     .     Thus 

(xly)  =      2_  k,,  Z  r-,  (x|xk)(y|x;j)(xkjxj) 

II,  (x|xk)(y|xk) 

(xT|yT)      . 

LEMMA  7.  If  a  pre-Hilbert  space  P   is  isomorphic  to  a  Hilbert 
space  %   in  the  sense  that  there  exists  a  mapping  T  of  /  onto  jf   which 
satisfies  the  conditions  for  a  Hilbert  space  isomorphism,  then  r   is 
itself  a  Hilbert  space. 

Proof : 

Let  the  sequence  |x_}  be  a  Cauchy  sequence  in    r   .     Then  by  condition 
(iii)  for  an  isomorphism,       J/x^T  -  xRT  \\      =     lxm  -  *n  II     <  £>   ".m  >  N(£)   . 

Thus   {xmT)is  a  Cauchy  sequence  in   U  ,   and   {x^}   — yy    £  %    .     But  there 

9  9 

is  an  x  6  f    such  that  xT  =  y.     Hence  Uxn  -  x  II      =  (|xnT  -  xT  H         <    £ 

for  n  >  N,   and  {x^}  — *■  x  . 

The  term  pre-Hilbert  space  has  been  used  to  designate  an  inner- 
product  linear  vector  space  which  is  not  necessarily  complete  in  the 
metric  defined  by  this  inner  product.  If  the  space  r    is  not  complete, 
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then  the  set  of  vectors  in  r   may  be  enlarged  by  the  ordinary  process 
of  metric  space  completion  to  a  new  set  "P  *   in  such  a  manner  that  P* 

forms  a  complete  metric  space.  Furthermore,  this  may  be  done  in  such 

■ft  *  >i 

a  way  that  A"   forms  an  inner-product  space  which  contains  r   (iso- 

morphically)  as  a  subspace.    That  is,  any  pre-Hilbert  space  can  be 
extended  to  a  Hilbert  space.  This  justifies  the  use  of  the  term  pre-Hilbert 
space  and  explains  the  importance  of  these  spaces  in  the  theory  of  Hilbert 
space. 

The  only  infinite-dimensional  Hilbert  space  which  has  been  considered 

2 
is  1  .  The  following  theorem  describes  a  more  general  Hilbert  space 

2 

which  includes  both  1  and  the  unitary  spaces  as  special  cases. 

THEOREM  17.   (The  Hilbert  Space  l2  fQ ]  .)  Let  Q  be  any  nonempty 
set  of  elements.  Let  1  [QJ  denote  the  class  of  scalar-valued  functions 
x  defined  on  Q,  such  that  the  set  of  allqe  Q  for  which  x(q)  ^  0  is  either 
finite  or  countably  infinite,  and  having  the  property  that 

2—    fx(q)l   <   °°  .   Define  the  sum  of  elements  of  l2  f  oj  and 
scalar  multiplication  as  in  Example  2,  and  for  x,y  £  l2  rQ  ]  ,  (x(y)  = 


4-,  x(q)y(q)  •  Then  l2  [q]   is  a  Hilbert  space. 

9c  Q 


Proof: 


Since  the  union  of  two  countable  sets  is  itself  a  countable  set, 
the  sum  of  two  elements  of  l2  [q]  is  an  element  of  l2  Cq]  .  Clearly 
1     IqJ   is  a  linear  vector  space  which  foras  a  pre-Hilbert  space  with  the 


-•■See  Angus  E.   Taylor,   Introduction  to  Functional  Analysis,   pp.   71-7$. 
98-99,   119.  
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inner  product  as  defined.     Thus  it  is  necessary  only  to  show  that 
1     [Qj  is  complete. 

If  Q  is  itself  finite  or  countably  infinite,   then  l2  [q]  is  iso- 
morphic either  to  one  of  the  spaces     C  n  or  to  l2  ,   and  is  therefore 
complete  by  Lemma  7.      If  on  the   other  hand  Q  is  not  countable,    then 
it  must  be  shown  that  a  given  Cauchy  sequence  (xj  in  l2  [q]  converges 
to  an  element  of  12[q]    .     By  hypothesis,   for  £>0  there  is  an 
N(6  )   >  0  such  that 

K  "  *nl  2  "    (*m  -  *nl*ra  -  ^)  -         T    jxjq)   -  xn(q)|  2    <£ 

whenever  m,n  >   N  .  In  particular,  for  a  fixed  q  e  Q,  then 

J^Cq)  -  ^(q)!2  <  X  1  x  (q)  -  x(q)|2  <  £  whenever  n,m  >  N. 

Thus  the  sequence  of  complex  numbers  {xm(q)j  is  a  Cauchy  sequence, 
and  must  therefore  converge  to  an  element  of  C  ,  which  may  be  denoted 
x(q)  .  Now  letting  q  range  over  Q,  the  scalar-valued  function  x(q)  is 
defined  for  all  q£Q.  Let  ?±  =       (q€Q:  x±(qtf   o}  ,  and 
P  =  {q^Q:  x(q)  ^  o}  .  If  q'e  P,  and  if  xn(q/)  =  0  for  an 
infinite  number  of  n,  then  necessarily  x(q')  =  0,  which  contradicts 
the  choice  of  q' .  Thus  xk(q' )  ^  0  for  some  k.  Hence  PcLiP.   . 
Since  the  countable  union  of  countable  sets  is  itself  countable,  it 
follows  that  P  is  countable. 

From  the  hypothesis   JZ.   U^Cq)/2  <  oo  ,  where  P.  is  countable, 
it  follows  that  each  x±(q)   generates  an  absolutely  square- summable  sequence, 
an  element  of  l2  .  The  completeness  of  I2  assures  that  the  sequence 


32 


generated  by  the  limit  x(q),  where  q  €  p,  must  also  be  an  element  of  l2  . 
Thus 

X,  I  x(q)  1  2  =     X  I  x(q)  |  2  <  «p    and  x(q)  e  l2  CQ 1  . 

Therefore  1  [Qj  is  complete  and  is  a  Hilbert  space. 

It  is  interesting  to  note  that  the  set  of  functions  fx  (q):  p  €.    q} 
defined  by  x^(q)  =  0  if  q  /  p,  xp(P)  =  1>   is  an  orthonormal  set  having 
the  same  cardinal  number  as  the  set  Q  itself.  Thus  there  exists  a  Hilbert 
space  containing  an  orthonormal  set  whose  cardinal  number  is  any  given 
infinite  cardinal.  If  Q  is  an  uncountably  infinite  set,  then  it  can  be 
shown  that  1  LQ]  is  a  Hilbert  space  which  is  not  separable. 
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The  purpose  of  this,  report  is  to  study  the  variety  of  linear  vector 
space  known  as  Hilbert  space.  The  original  example  of  this  type  of  space 
was  first  studied  extensively  by  the  German  mathematician  David  Hilbert, 
in  connection  with  his  work  with  integral  equations.  This  space  is  known 
as  classical  Hilbert  space.  It  was  abstracted  by  J.  von  Neumann  to  the 
variety  of  linear  vector  space  called  Hilbert  space. 

The  initial  sections  of  the  report  contain  basic  properties  of  linear 
vector  spaces  including  a  particular  type  of  complex-valued  function, 
called  an  inner  product,  on  pairs  of  elements  of  these  spaces.  The  norm 
of  each  element  is  defined  in  terms  of  this  function,  and  a  metric  on 
the  space  is  defined  in  terms  of  this  norm.  As  metric  spaces,  the  vector 
spaces  may  or  may  not  possess  the  topological  property  of  completeness, 
in  the  sense  that  every  Cauchy  sequence  of  vectors  converges  in  the  metric 
to  a  vector  of  the  space.  The  vector  spaces  with  an  inner  product  which 
possess  this  property  of  completeness  are  known  as  Hilbert  spaces. 

One  of  the  most  important  concepts  in  the  theory  of  inner-product 
spaces  (called  pre-Hilbert  spaces),  is  that  of  orthogonality.  Two  vectors 
are  said  to  be  orthogonal  if  their  inner  product  is  zero.  A  sequence  of 
vectors  is  said  to  be  total  provided  that  each  vector  has  norm  one,  that 
each  pair  of  distinct  vectors  is  orthogonal,  and  that  the  only  vector 
which  is  orthogonal  to  every  vector  of  the  sequence  is  the  null  vector. 
A  Hilbert  space  containing  such  a  sequence  is  said  to  be  separable. 
Theorem  16  states  that  every  finite-dimensional  separable  Hilbert  space 
is  isomorphic  to  the  unitary  space,  described  in  Example  h,   of  the  same 
dimension,  and  every  infinite-dimensional  separable  Hilbert  space  is 


isomorphic  to  classical  Hilbert  space.  The  final  theorem  describes  a 
class  of  Hilbert  spaces  defined  on  arbitrary  sets  Q.  For  particular  Q, 
this  space  may  be  separable  and  either  finite  or  infinite,  or  infinite 
and  not  separable.     , 


